Dec 16 New evaluation completed — GPT-4o achieves 74.4% accuracy on SEC enforcement predictions.

Model Performance Comparison

GPT-4o Best
64.9%
Overall Accuracy
Resolution38.6%
Monetary53.0%
Injunction78.8%
Officer Bar89.2%
500 cases evaluated
Claude Opus 4
46.8%
Overall Accuracy
Resolution38.6%
Monetary23.5%
Injunction79.8%
Officer Bar92.0%
500 cases evaluated
Gemini 2.0
Coming Soon

Showing GPT-4o predictions on 500 evaluated cases below.

Matter
Agency
Type
Filed
Status
Score